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PsJ . Abstract 

fj ' Minimizing the discrepancy of a set system is a fundamental problem in combinatorics. One 

^~^ I of the cornerstones in this area is the celebrated six standard deviations result of Spencer (AMS 

1985): In any system of n sets in a universe of size n, there always exists a coloring which 

achieves discrepancy 6^/n. The original proof of Spencer was existential in nature, and did not 

give an efhcient algorithm to find such a coloring. Recently, a breakthrough work of Bansal 

(FOCS 2010) gave an efficient algorithm which finds such a coloring. His algorithm was based 

^^J • on an SDP relaxation of the discrepancy problem and a clever rounding procedure. In this work 

^_) I we give a new randomized algorithm to find a coloring as in Spencer's result based on a restricted 

tyj ' random walk we call Edge-Walk. Our algorithm and its analysis use only basic linear algebra 

O I and is "truly" constructive in that it does not appeal to the existential arguments, giving a new 

proof of Spencer's theorem and the partial coloring lemma. 

(N 
> 
C^ ■ 1 Introduction 

iy~. ' Minimizing the discrepancy of a set system is a fundamental problem in combinatorics with many 

• ■ applications in computer science (see |Mat99l ICha02j ). Here, we are given a collection of sets S 

f^ , from a universe V = {1, . . . , n} and the goal is to find a coloring x : 1/ — t- {1, —1} that minimizes 

CN I the maximum discrepancy xi'S) = rnaxgg^ | X^jgg x(^)l- We denote the minimum discrepancy of S 

by disc(5). 

There is by now a rich body of literature on discrepancy minimization with special focus on the 
^ I 'discrete' formulation described above. One of the cornerstones in this area is the celebrated six 

standard deviations result of Spencer |Spe85| . 



Theorem 1. For any set system {V,S) with \V\ = n, \S\ = m, there exists a coloring x '■ ^ ^ 
{1, —1} such that x{S) < Ky^n ■ log2{m/n), where K is a universal constant (K can he 6 ifm = n). 

One remarkable aspect of the above theorem is that for m = 0{n), the discrepancy is just 
0{y/n), whereas a random coloring has discrepancy 0{^/n\ogn). Spencer's original proof relied 
on an ingenious pigeon-hole principle argument based on Beck's partial coloring approach |Bec81j . 
However, due to the use of the pigeon-hole principle, the proof was non-constructive: Spencer's 
proof does not give an efficient (short of enumerating all possible colorings) way to find a good 
coloring x ^^ iii the theorem. This was a longstanding open problem in discrepancy minimization 
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and it was even conjectured that such an algorithm cannot exist |AS11] . In a recent breakthrough 
work, Bansal [BanlO] disproved this conjecture and gave the first randomized polynomial time 
algorithm to find a coloring with discrepancy 0{\/n ■ log(m/n)), thus matching Spencer's bound 
up to constant factors for the important case of m = 0{n). 

In this work we give a new elementary constructive proof of Spencer's result. Our algorithm 
and its analysis use only basic linear algebra and perhaps more importantly is "truly" constructive. 
Bansal's algorithm while giving a constructive solution, still implicitly uses Spencer's original non- 
constructive proof to argue the correctness of the algorithm. Our algorithm on the other hand also 
gives a new (constructive) proof of Spencer's original result. 

Theorem 2. For any set system (y,S) with \V\ = n, \S\ = m, there exists a randomized algorithm 
running in time 0((n+m)^)Q| that with probability at least 1/2, computes a coloring x '■ V ^ {^, —1} 
such that xi-S) < Ky^n • log2{m/n), where K is a universal constant. 

The constant K above can be taken as 13 for the case oi m = n. Observe that our bound 
matches Spencer's result for all ranges of m, n, whereas Bansal's result loses an additional factor 
of il(Y^log(m/n)). 

We also get a similar constructive proof of Srinivasan's result |Sri97j for minimizing discrepancy 
in the "Beck-Fiala Setting" where each variable is constrained to occur in a bounded number of sets. 
Bansal was able to use his SDP based approach to give a constructive proof of Srinivasan's result. 
Our techniques for Theorem [2] also extend to this setting matching the best known constructive 
bounds. 

Theorem 3. Let {V,S) be a set-system with \V\ = n, \S\ = m and each element ofV contained in 
at most t sets from, S. Then, there exists a randomized algorithm running in time 0((n + tti)^) that 
with probability at least 1/2 computes a coloring X • ^ ~^ {1)~1} such that xi-S) < K\/t ■ logn, 
where K is a universal constant. 

We remark that non-constructively, a better bound of 0{\/t ■ logn) was obtained by Ba- 
naszczsyk |Ban98| using techniques from convex geometry. Beck and Fiala |BF81] proved that 
disc(5) < 2t and conjectured that disc(5) = 0(vt) and this remains a major open problem in 
discrepancy minimization. 

2 Outline of Algorithm 

To describe the algorithm we first set up some notation. Fix a set system {V, S) with V = {1, . . . , n} 
and |5| = m. As is usually done, we shall assume that m > n ~ the general case can be easily 
reduced to this situation. Similar to Spencer's original proof our algorithm also works by first 
finding a "partial coloring" : x ^ ^ ~^ [~ 1 ) 1] such that 



• For aU S G 5, \x{S)\ = 0{yJn\og{m/n)). 

• \{i : |x(OI = 1}I ^ en, for a fixed constant c > 0. 

Given such a partial coloring, we can then recurse (as in Spencer's original proof) by running the 
algorithm on the set of variables assigned values in (—1,1) without changing the colors of variables 
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assigned values in {1,-1}. Eventually, we will converge to a full coloring and the total discrepancy 
(a geometrically decreasing series with ratio roughly ^/l — c) can be bounded by 0(-y/nlog(m/n)). 
Henceforth, we will focus on obtaining such a partial coloring. 

Let vi, . . . ,Vm S ffi" be the indicator vectors of the sets in S. Then, the discrepancy of x on S 
is xi'S) = niaxjg[„] | {x,Vi) |. Our partial coloring algorithm (as does Spencer's approach) works in 
the more general context of arbitrary vectors, and we will work in this general context. 

Theorem 4 (Main Partial Coloring Lemma). Let vi,. . . ,Vm G IR" be vectors, and xq G [—1, 1]" 
be a "starting" point. Let ci,...,Cm > be thresholds such that X^^i exp(— c^/16) < n/16. Let 
6 > be a small approximation parameter. Then there exists an efficient randomized algorithm, 
which with probability at least 0.1 finds a point x G [—1, 1]" such that 

(i) I {x-xo,Vj) I < Cj||i;j||2. 

(ii) \xi\ > 1 — 6 for at least n/2 indices i £ [n]. 

Moreover, the algorithm runs in time 0{{m + n)'^ • 5^"^ • \og{nm/8)). 

Note that the probability of success 0.1 can be boosted by simply running the algorithm multiple 
times. Given the above result, we can get the desired partial coloring needed for minimizing set 
discrepancy by applying the theorem to the indicator vectors of the sets S £ S with 6 = 1/n, and 
Xq = 0". Combining the above with the recursive analysis gives Theorem [2] with a running time of 
0{{n + m)^). It was pointed to us by Spencer that we can in fact take 6 = 1/logn and then use 
randomized rounding to get the running time stated in Theorem [2l 

We stress that Spencer's original approach shows the existence of a true partial coloring (the 
colors take values in {—1,0, 1}), whereas our approach gives a fractional coloring — the colors take 
values in [—1, 1] though many of the colors are close to { — 1, 1}. 

The constructive proof of Srinivasan's result. Theorem [3l follows a similar outline starting from 
our partial coloring lemma. We defer the details to Section [6l 

We now describe the proof of the partial coloring lemma. 

2.1 Partial Coloring by Walking on The Edge 

We will find the desired vector x by performing a constrained random walk that we refer to as 
Edge-Walk for reasons that will become clear later. 

We first describe the algorithm conceptually, ignoring the approximation parameter 6. We 
will assume throughout that ||fi||2 = • • • = ll^^mlb = 1 as this normalization does not change the 
problem. Consider the following polytope V which describes the legal values for x £ M", 

V ■.= {x£W : \xi\ < 1 Vi G [n], | (x - xq, Vj) \ < Cj Vj G [m]}. 

We will refer to the constraints \xi\ < 1 as variable constraints and to the constraints \ {x — xo,Vj) \ < 
Cj as discrepancy constraints. The partial coloring lemma can be rephrased in terms of the polytope 
V as follows: there exists a point x £ V that satisfies at least n/2 variable constraints without any 
slack. Lituitively, this corresponds to finding a point x in "P that is as far away from origin as 
possible; the hope being that if ||x||2 is large, then in fact many of the coordinates of x will be close 
to 1 in absolute value. We find such a point (and show it's existence) by simulating a constrained 
Brownian motion in V. (If uncomfortable with Brownian motion, the reader can view the walk as 
taking very small discrete Gaussian steps, which is what we will do in the actual analysis.) 



Consider a random walk in V corresponding to the Browninan motion starting at x = xq. 
Whenever the random walk reaches a face of the polytope, it continues inside this face. We 
continue the walk until we reach a vertex x € V. The idea being that we want to get away from 
origin, but do not want to cross the polytope - so whenever a constraint (variable or discrepancy) 
becomes tight we do not want to change the constraint and continue in the subspace orthogonal to 
the defining constraint. We call this random walk the "Edge- Walk" in V. 

By definition, the random walk is constrained to V, and | {x — xq,Vj) \ < Cj for all j G [m]. 
We show that as long as ^Q^p{—Cj) *C n, the random walk hits many variable constraints with 
good probability. That is, the end vertex x has Xi € { — 1, 1} for many indices. This step relies on 
a martingale tail bound for Gaussian variables and an implicit use of the ^2-iiorm as a potential 
function for gauging the number of coordinates close to 1 in absolute value. 

The actual algorithm differs slightly from the above description. First, we will not run the walk 
until we reach a vertex of V, but after a certain 'time' has passed, which will still guarantee the 
above conditions. Second, we will approximate the continuous random walk by many small discrete 
steps. 

3 Comparison with Entropy Method 

Here we contrast our result with Beck's partial coloring lemma |Bec81j based on the Entropy method 
which has many applications in discrepancy theory. While similar in spirit, our partial coloring 
lemma is incomparable and in particular, even the existence of the vector x as in Theorem U] does 
not follow from Beck's partial coloring lemma. 

We first state Beck's partial coloring lemma as formulated in |Mat98| . 

Theorem 5 (Entropy Method). Let {V,S) be a set-system with V = {1, . . . ,n}. Let A : 5 — ;■ M+ 
be such that "^g^g g{^s/\/\S\) < n/5, where g : R^ — )• M+ is defined by, 

{Ke-^'l\ X > 0.1 
^^ '~ |i^ln(l/A), A<0.1' 

where K is an absolute constant. Then, there exists x £ {"l^O, 1}" with \{i : Xi 7^ 0}| > n/2 such 
that I Y^idS X«l ^ ^5" /o'" ^'"(^'^y s gS. 

By applying our Theorem[5]to the indicator vectors of the sets in S and 5 = l/poly(n) sufficiently 
small we get the following corollary. 

Corollary 6. Let {V,S) be a set-system with V = {1, . . . ,n}. Let A : 5 — t- M+ be such that 

J]exp(-A|/16|5|)<n/16. 

ses 

Then, there exists x £ [~lil]" with \{i : \xi\ = 1}| > ri/l, such that \ YliesXil ^ ^s + l/poly(n), 
for every S G S. Moreover, there exists a randomized poly{\S\, n)-time algorithm to find x- 

The above result strengthens the Entropy method in two important aspects. Firstly, our method 
is constructive. In contrast, the entropy method is non-constructive and the constructive discrep- 
ancy minimization algorithms of Bansal do not yield the full partial coloring lemma as in Theorem[5j 



Secondly, the above result can tolerate many more stringent constraints than the Entropy method. 
For instance, the entropy method can only allow 0{n/logn) of the sets in S to have discrepancy 
1/n, whereas our result can allow ri(n) of the sets to have such small discrepancy. We believe that 
this added flexibility in achieving much smaller discrepancy for a constant fraction of sets could be 
useful elsewhere. 

One weakness of Theorem |4] is that we do not strictly speaking get a proper partial coloring: 
the non {1,-1} variables in our coloring x can take any value in (—1, 1). This however does not 
appear to be a significant drawback, as Corollary [6] can also be made to work from an arbitrary 
starting point xq as in the statement of Theorem [H 

4 Preliminaries 

We start with some notation and few elementary properties of the Gaussian distributions. 

4.1 Notation 

Let [n] = {1, . . . , n}. Let ei, . . . , e„ denote the standard basis for M". We denote random variables 
by capital letters and distributions by calligraphic letters. We write X ^ D for a random variable 
X distributed according to a distribution D. 

4.2 Gaussian distribution 

Let A/'(/i,o"^) denote the Gaussian distribution with mean ^ and variance a^. A Gaussian distri- 
bution is called standard if ^u = and a'^ = 1. If Gi ~ A/'(/ii,(T^) and G2 ~ A/'(/i2,o"2) then for 
ti,t2 E M we have 

tiGi + taGa ~ AA(ti/ii + t2/x2, tfcr? + t^a^). 

Let V C M" be a linear subspace. We denote by G ~ MiV) the standard multi-dimensional 
Gaussian distribution supported on F: G = GiVi + . . . + GdVd, where {vi, . . . , Vd} is an orthonormal 
basis for V and Gi, . . . ,Gd ~ AA(0, 1) are independent standard Gaussian variables. It is easy to 
check that this definition is invariant of the choice of the basis {vi, . . . ,Vd}- We will need the 
following simple claims. 

Claim 7. Let V Q W^ be a linear subspace and let G ~ J^{V). Then, for all u G M", {G,u) ~ 
7V(0, cr^), where a"^ < WuW^. 



r\j 



Proof. Let G = Givi + . . .+GdVd where {vi, . . . , Vd} is an orthonormal basis for V and Gi, . . . ,Gd 
A^(0, 1) are independent. Then {G,u) = ^j=x {u,Vi) ■ Gi is Gaussian with mean zero and variance 

Eti(«>^^)'<ll"lli- a 

Claim 8. Let V Q R"- be a linear subspace and let G ~ ^J'{V). Let {G,ei) ~ M{0,af). Then 

Er=i<x,2 = dim(y). 



r\j 



Proof. Let G = GiVi + . . ■ + GdVd where vi, . . . ,Vd are an orthonormal basis for V and Gi, . . . ,Gd 
AA(0,1) are independent. Then, Er=i '^f = EtiM (G,e,) p] = E[||G||1] = Eti \Ml-E[G^] = 
d = dim(y). D 

The following is a standard tail bound for Gaussian variables. 



Claim 9. Let G ~ iV(0, 1). Then, for any A > 0, Pr[|G| > A] < 2exp(-A2/2). 

We will also need the following tail bound on martingales with Gaussian steps. It is a mild 
generalization of Lemma 2.2 in [BanlOj and we omit the proof. 

Lemma 10 ( [BanlOj ). Let Xi, . . . ,Xt he random variables. Let Yi, . . . ,1t be random variables 
where each Y^ is a function of X^. Suppose that for all 1 < i < T, xi, . . . ,Xi_i G M, l^KXi = 
3;i,X2 = X2, ■ ■ ■ ,Xi^i = Xj-i) is Gaussian with mean zero and variance at most one (possibly 
different for each setting of xi, . . . , xi^i). Then for any A > 0, 

Pr[|yi + ... + Yt\> X^/f] < 2exp(-AV2). 

5 Main Partial Coloring Lemma 

We are now ready to present our main partial coloring algorithm and prove Theorem HI We shall 
use the notation from the theorem statement and Section 12. li 



Let 7 > be a small step size so that 6 = 0{'yy^log{nm/j)). We note that the correctness of 
the algorithm is not affected by the choice of 7, as long as it is small enough; only the running time 
is affected. 

Let T = i^i/7^, where Ki = 16/3. We assume that 6 < 0.1. The algorithm will produce 
intermediate steps Xq = xq,Xi, . . . , Xt G M" according to the following update procesci 

Edge- Walk: For t = 1, . . . , T do 

• Let CJ^^ := CJ^^{Xt-i) = {i £ [n] : \{Xt-i)i\ > 1-5} he the set of variable constraints 'nearly 
hit' so far. 

• Let Cf'''' := Cf''''{Xt-i) = {j £ [m] : \ {Xt-i - xq,vj) \ > cj - 5} be the set of discrepancy 
constraints 'nearly hit' so far. 

• Let Vt := V(Xi„i) = {u e W : Ui = yi e C^', {u,Vj) = Vj G Cf'"'} be the linear 
subspace orthogonal to the 'nearly hit' variable and discrepancy constraints. 

• Set Xt := Xt-i + 7C/t, where Ut ~ M{Vt). 

The following lemma captures the essential properties of the random walk. 

Lemma 11. Gonsider the random walk described above. Assume that Xl^i 6^P(~'^?/16) < n/16. 
Then, with probability at least 0.1, 

i. Xo,...,XTer. 

2. \[Xt)%\ > 1 — 5 for at least n/2 indices i G [n]. 



"^We call the random walk "Edge- Walk" because geometrically, once the walk (almost) hits an edge (face) of the 
polytope V, it stays on the edge. 



Theorem H] follows immediately from Lemma [TT] by setting x = Xt- Note that computing 
^var^^disc^ given Xt-i takes time 0{nm). Further, once we know the set of constraints defining Vt, 
we can sample from Af{Vt) in time 0{{n + m)^) by first constructing an orthogonal basis U for Vt 
and setting Ut = '^ueu ^uU-, where Gu ^ J^ are chosen independently. 

We prove Lemma [11] in the remainder of this section. We start with a simple observation that 
^var^^disc ^^^^ only increase during the random walk. 

Claim 12. For all t < T we have Q"'' C Q^\ and Cf'"" C Cf^^. In paHicular, for 1 < t < T, 
dim{Vt) > dim{Vt-\-i) ■ 

Proof. Let i £ CJ^^ . That is, |(X(_i)j| > 1 — 5. Then by definition of the random walk, Ut G Vt 
and (Ut)i = 0. Thus, {Xt)i = {Xt-i)i and i G CJ^\. The argument for discrepancy constraints is 
analogous. D 

We next show that the walk stays inside V with high probability. 



Claim 13. For 7 < bj \JC log(mn/j) and C a sufficiently large constant, with probability at least 
l-l/{mnf-^Xo,...,XT€V. 

Proof. The proof involves a simple application of the tail bound from Claim[9j Clearly Xq = xq £ V. 
Let Et := {Xt ^ VIXq, . . . ,Xt^i £ V} denote the event that Xt is the first element outside V, so 

Pr[Xo, ...,XTeV] = l-El=i P^Et]. 

In order to calculate Pr [£^t] , note that if Et holds then Xt must violate either a variable constraint 
or a discrepancy constraint. Assume for example that Xt violates a variable constraint, say {Xt)i > 
1. Since Xt-i £ V we must have {Xt-i)i < 1. However, we we must in fact have |(Xi_i)i| < 1 — 6 
as otherwise we would have i G CJ^'^ and hence {Ut)i = and {Xt)i = {Xt-i)i. Thus, in order for 
this situation to occur we must have that |(C/t)i| > §/j. We will show this is very unlikely. 

Let W := {ei, . . . ,en,vi, . . . , Vm}- We conclude that if Et holds then | {Xt — Xt-i,w) | > (5 for 
some w G W. That is, | {Ut,w) \ > 6/j. We next bound the probability of these events. Since 
Ut ~ AA(Vt) we have by Claim [7] that {Ut,w) is Gaussian with mean and variance at most 1. 
Hence by Claim [9l 

Pr[| {Ut,w) I > V7] < 2exp(-(V7)V2). 



By our setting of parameters 6/j = y/Clog{nm/^)) and T = 0(1/7^). Thus, 

T T 2-1 



(mn)'-^ imn)'^ ^ ' 
t=l t=l w€W ^ ' ^ ' 

for C large enough. D 

We are now ready to prove Lemma [TTl The intuition behind the proof is as follows. We first 
use the hypothesis on the thresholds Cj,j G [m], to argue that E[ \C^^'^\ ] <C n. This follows from the 
definition of the walk and a simple application of the martingale tail bound of Lemma [TOj Note that 
to prove the lemma it essentially suffices to argue that E[|CJ?''^|] = J7(n) (we can then use Markov's 
inequality). Roughly speaking, we do so by a "win-win" analysis. Consider an intermediate update 
step t < T. Then, either |C™^| is large, in which case we are done, or |C™^| is small in which 
case di7n{Vt-i) is large so that E[||Xt|p] increases significantly (with noticeable probability) due 
to Claim [HI On the other hand, H-'^ilP < n as all steps stay within the polytope V (with high 

7 



probability). Hence, \CJ^'^\ cannot be small for many steps and in particular \C^'^\ will be large with 
noticeable probability. 

We first argue that E[|C^"^^|] is small. That is, on average only a few discrepancy constraints 
are ever nearly hit. 

Claim 14. E[|C;^*"'=|] < n/4. 

Proof. Let J := {j : Cj < 106}. To bound the size of J, we have 

n/16 > ^exp(-c^/16) > \J\ • exp(-100(5Vl6) > \J\ • exp(-l/16) > 9|J|/10, 

and hence \J\ < 1.2n/16. Now, for j ^ J, if j G C^''^^ then | {Xt - xo,Vj) \ > Cj - 6 > 0.9cj. We 
will bound the probability that this occurs. Recall that Xt = xq + ^(Ui + . . . + Ut) and define 
Yi = {Ui,Vj). Then, for j ^ J, we have 

Pr[j G C^'^=] < Pr[ m + . . . + Yt\ > 0.9cj/j]. 

We next apply Lemma [101 Note that the conditions of the lemma apply, since Ui, . . . , Ut is a 
sequence of random variables, 1^ is a function of Ui and Yi\{Ui, . . . , C/i-i) is Gaussian with mean 
zero and variance at most one (by Claim [7]). Hence, 

Pr[j E Cf""^] < 2exp(-(0.9cj)V27^T) = 2ex.p{-{0.9cjf /2KiT) < 2ex.p{-c]/l6). 

So 

E[\Cf'^\] < \J\ + J^Pr[i G 4'^^] < 1.2n/16 + 2n/16 < n/4. 

D 
Claim 15. ]E[||Xt||2] <n. 

Proof. We will show that E[(Xt)?] < 1 for all i G [n]. Conditioning on the first t for which i G C™'' 
(or that no such t exists), we get 

T 

e[(Xt)?] = Pr[z ^ cn n{XT)l\i i CT\ + Y. P^[^ ^ cr \ CTA\ ie[(Xt)?|z g cr \ CTA\. 

t=l 
Clearly E[(Xt)- |i ^ CJ?'] < 1. For t < T, we have 

E[{XT)h^ G cr \ Cri] = E[(^t)'|i e cr \ Cri] < l - ^ + 7E[|(f^t)d^] < 1, 

where we used the fact that {Ut)i is a Gaussian variable with mean zero and variance at most one 
(by Claim [T]). D 

Finally, we show that E[|C™''|] is large. That is, on average we will nearly hit a constant fraction 
of the variable constraints. 

Claim 16. E[|C^''''|] > O.SGra. 



Proof. We start by computing the average norm of Xf. 

E[||Xt||2] =E[||Xi_i +7C/J2] =E[||X,_i||2] +72]E[||f/^||2] ^E[||Xt_i||2] + 72E[dini(Vt)], 

where we used that fact that given Xj_i, E[t/t|Xt_i] = and E[||[/4||2|Xi_i] = dim(Vt), by Claim[8l 
Hence, by Claim \T5\ 



n > E[||Xt||2] > 7^^E[dim(Vt)] > 7^|r|-E[dim(VT)] = is:rE[dim(VT)] = KiK[{n-\C^'''\-\Cf' 



->disc| 
t=l 

Therefore, E[|CJ?''|] > n(l - 1/Ki) - E[|C^''''=|] > n(l - 1/Ki - 1/4) > (0.56)n, where the second 
inequality follows from Claim [TH D 

Lemma [TT] now follows immediately from Claim [T3] and Claim \W[ 

Proof of LemmalTJl From Claim [Hand the fact that |CJ^''| < n, it follows that P[|CJ?''| > n/2] > 
0.12. Combining with Claim [T3l with probability at least 0.12 - l/poly{m,n) > 0.1, jC™''! > n/2 
and Xt G V which shows the lemma. D 

6 Discrepancy Minimization from Partial Coloring 

We now derive Theorem [2] and Theorem [3] from our partial coloring lemma. 

Proof of Theorem\^ Let iV^S) be a system with \V\ = n and |5| = m. Let vi,...,Vm G K" 
be the indicator vectors of the sets in S. We set 5 = 1/(8 log m). Let a{m,n) = 8y/log{m/n). 
Then, m ■ ex.p{—a{m,n)'^/16) < n/16. Therefore, by Theorem U] applied to vi,. . . ,Vm and starting 
point xo = 0", with probability at least 0.1 we find a vector xi G [—1, 1]" such that | {vj,xi) \ < 
y/n ■ a{m,n) for all j £ m and \{i : |(a;i)i| > 1 — (5}| > n/2. We can boost this probability further 
by repeating the algorithm O(logn) times; from now on we will ignore the probability that the 
algorithm does not find such a vector. 

Let Ii = {i : |(xi)j| < 1 — 6} he the coordinates not 'fixed' in the first step and set ni = 
|Ii|. We now iteratively apply Theorem H] to the restricted system described by the vectors vl = 
{vi)i-^, . . . ,v^ = {vm)ii £ K*^^ and starting point (xi)/^ to get another vector X2 S [—1, 1]"^ such 
that |(t;j,X2)| < ^/nl ■ a{m,ni) for all j G [m] and \{i : \{x2)i\ > ^ — S}\ > ni/2. By iterating this 
procedure for at most t = 21ogn times and concatenating the resulting vectors appropriately we 
get X £ R"" such that |xj| > 1 — 6 for all i G [n] and for every j £ [m], 

I {vj,x) I < ^/n • a{m, n) + ^/nl ■ a{m, ni) + • • • + y/nl ■ a{m, nt) 
8y/log(m-27n) 



<v^E 



r=0 



2r/2 



< C\Jn ■ log{m/n), 

for C a universal constant. 

We now round x to get a proper coloring x G {1, — 1}". Let x G {1, —1}" be obtained from x 
as follows: for i G [n], Xi = sign{xi) with probability (1 + |xj|)/2 and —sign{xi) with probability 



(1 — |2;j|)/2, so that E[xj] = Xj. Let y = x — x. Fix some j G [m]. Then, the discrepancy of x with 



I ix, Vj) \<\{x, Vj) \ + \{Y, Vj) I < Cy/nlog{m/n) + | {Y, Vj) |. 

We will show that with high probability, | {Y,Vj) \ < y/n for all 1 < j < m. Fix some j E [m] and 
consider {Y,Vj). We have that \Yi\ < 2, K[Yi\ = and Var(yj) < 6. We also have ||fj||2 < \/n and 
ll'i'jlloo < 1- Thus, by a standard Chernoff bound (see e.g., Theorem 2.3 in |CL06] ). 

{Y,Vj) I > 2 a/2 log m • \/n^ < 2exp(-21ogm) < l/2m. 

Therefore, by the union bound and our choice of 5, with probability at least 1/2 we have that 
I (y, Vj) I < ^/n for all 1 < j < m. Therefore, | {x, Vj) \ < Cy^nlog{m/n) + ^/n for all 1 < j < m. 

The running time is dominated by the 0(log n) uses of Theorem [H Thus, the total running 
time is 0{{n + m)'^ log {mn)) = 0{{n + m)'^). D 

The constant in the theorem can be sharpened to be 13 by fine tuning the parameters. We do 
not dwell on this here. We next prove Theorem El 

Proof of Theorem The proof is similar to the above argument and we only sketch the full proof. 
Set 6 = 1/n. Let {V,S) be the set system. Let vi,. . . ,Vm be the indicator vectors of the sets in 
S and let Cj = CVi/\\vj\\2 for C to be chosen later. Observe that ^- ||t'j||| < nt as each element 
appears in at most t sets. In particular, the number of vectors Vj with ||fj||2 in [2^t,2'^^^t] is at 
most n/2^. Therefore, 

j r=0 

for C a sufficiently large constant. Thus, by applying Theorem U] to the vectors Vj and thresholds 
Cj for j £ [m], with probability at least 0.1 we get a vector xi G [—1, 1]" such that | {vj,xi) \ < C\ft 
for ah j G [m] and \{i : \(x\)i\ >\-S]\> n/2. 

By iteratively applying the same argument as in the proof of Theorem 2] for 2 log n steps, we 
get a vector x E [—1, 1]"" with \xi\ > 1 — 5 for all i and | {vj,x) \ < 2C\/tlogn for all j £ [m]. 
The theorem now follows by rounding the x to the nearest integer coloring x'- Xi = sign{xi) for all 
i G [m]. n 
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